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(57) Abstract 

A method for efficiently screening full-length cDNA clones which comprises; determining the base sequence in the 5'-region of each 
clone contained in a cDNA library prepared by a method for constructing a cDNA library involving full-length ones at a high ratio; 
examining the presence/absence of initiation ATG in this 5'-region and the location thereof by using an originally developed software for 
anticipating initiation codons in cDNA; thus exactly judging the presence/absence of the initiation codon and the location thereof; and 
screening the cDNAs thus judged as carrying the initiation codon from the cDNA library. Moreover, a cDNA library containing full-length 
ones at an extremely high ratio can be constructed by mixing the clones thus selected above. 
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5££§©cDNA£ o - > £36* <fc < f# § C £ &mW 7. =r v 7 1 & § o 

- jtLiij, m&n mm mm, 38, 476 -48i (1993)., • tu, 

M6ft filffc, 41, 603-607 (1996)., M. Maruyama and S. Sugano, Gene, 
138, 171-174 (1994).), * U zS y 7j£i:0kayam-Berg&&*fi#'&to-lirT^ 
fgLfceftiltf'J y7*?£(S. Kato etal., Gene, 150, 243-250 (1994)., to 
M • IPS ^H¥6-153953-^S$S 1994^6330), CAPtCDNAU > 13 t 
3 'J >*— -ft3Mg£i£(N. Merenkova and D. M. Edwards, W0 96/34981 Nov. 7, 
1996.), CAPCDK^-f->^ftit:J:S+^y7 , ^iifS(j*(P. Carninci et al., G 
enomics, 37,327-336 (1996)., P. Carninci et al., DNA Research, 4, 61-66 
(1997).)Tfc6o Zh%&T^xnm^<DmMk(DCK?*mmtZ>fi&X &t>, 

-5 d h££ »)^gcDNA©Jt*©S5l^-f 7*7 U ft 1ST tTIi, 5' 
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m (I.Edery et al. ,MCB, 15, 3363-3371(1995)) tz¥1$ l 8\t>*iX l^£„ Cti^fel^ 
(DlsmtLX\±, fg-|icDNA©5'$!ffl£C-7M U>7UTCap Switch oligonucleo 
tide^T — — ;b^-y-^)Cap Switch oligonucleotide^Tfo&Cap Finder (Clonte 

^TcDNA^-1' 77 U-^ft^Lfe^(3(± s ^%±&(D>7U-yft.&t&Z>mM 

^i§'»-e^D-><b-r^fc4?)(i, cdna^ 77 v-izsttiz&tn-ytf^ 

S^cDNA7^77 U-©^^^^*^^^,- fc£tmtTSo 

cduizmmm^ %z.tfre>* mmmft? F><D&&<D%mzm 

mt^Xs cDNA7 -i 77 U-a»£»*ttfc55±gcDNAfcS!RT3 

H^cDNAt^ 7*7 U-tf^&T-cDNA^ 77 5'«J;WSiI 
©DNA JiSE5»J£ifei£U 3©^$|*jtipR!lten h* >#££lT3*>gfr£»*lT 

(DcWktu-yzntZZttfxg&t^Z-tzo 

$IT £ 7 □ 7 7 A (I P L T Ji, 15 1 A, £H38 # & £ *lT ^ © #IjM£T' & 3 
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($Rn#'Jh r A. G. Pedersen, Proceedings of fifth international c 

onference on intelligent systems for molecular biology, p226-233 (1997), 
held in Halkidiki, Greece, June 21-26, 1997. j #W £>ft£) 0 x*v>£ 
^mt Z>7°v 7? ■klz^\^x\Zffl%£tiZ$X&i^Z>tf ( i"Gene Finderj V. V. 
Solovyev et-al., Nucleic Acids Res., 22, 5156-5163 (1994)., fGrailj Y - 
Xu et al., Genet-Eng-N-Y. , 16, 241-253 (1994)) , uCrD^A^l^ 

flt^ $c±mm<£>m)<^\}M? <i 77 U — f^EtSciSTf^MLfccDNA^ 77'J-i: 

Hfrfttif;*:, £1\ ^U3>v^7^T^McDNA^^7 r 7U-^{tML, cDNA 
— (Z'a^ti-S^ < ofr©£ d — >{;oi/NT-?-©5' J&SiS^J £ 

£ « 5' mmcD&fez titzi&mm iz o ^ mmm^ f>©mj;^© 

© ^ h - Sir c J: ^ m v ^ fd t o 

#£T£^J££ftfc^o->£cDNA^77 U-^5>3i^rti(i\ ^mz% 
±^©cDNA^P->^3i^K-r^Ci:^"pI^T'fe^^i:$:Mv^^LAco £&fc^ * 
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NA5^ 7*5 *)—*ft!&-Z%ZZt*fi^1ZLtzo 

w*>s *mm. cdna5^7*5 u-A»?>^±ficDNA^p->*a^-ra^?*. 

(1) f£±:lcDNA^D->£MTS£&-e&^-a 

(a) cDNA5^7*5 U-C^^tiScDHA^D->©5'*S8*»P,«iE9iJ%^-r 

(b) (a) T^^nfcJfiSiH5»J + CliJ&3 ^*>As^ti^TV^S*»g*»*^ ffl 

(c) (b) H>#^g-ftTU£>^J££n;t7D->£M3R-r?>X*S, 

( 2 ) cDNA5 >f 7*5 U -#^£fi*©E5^cDNA7 7*5 U -ftta£fc «fc ») ft 
SlSftfc *k (1) tlB«©*^ 

(3) cDNA5^ 7*5 V-tfmRM(DCk?*\mt%l-m&3&J5mz£y)i'fM$ 
tifc*)©T-fei>, ( 1 ) fcfEt&©£*iU 

(4) ^MNAv-f 77 'J-^ftStS^^T-JfeoTs 

(a) cDNA5^ 75 U-(z^*tlScDHA^D->©5 , *J|fi*»e)^K9iJ*yi^-r 

(b) (a) T&Z£titzi&mm*£m%i? H>*s^^tir^**»g*»*s is 

(c) (b) -e^^ K>A J #*nT^5i:*"J^^tifc^D->*a4R-r5Xg N 

(d) (c) -Cjgft£ftfc*D->fci!£-r5x^ £^tr#Su 

( 5 ) cDNA5 -f 75 U -**^M*©aVNcDNAv -i 7*5 U -ftMv£t- «fc \) ft 
St^tifc $©-£&£> (4) CiB«©^^ 

( 6 ) cDNA5 -i 7*5 y-#nRNA©CAP4fif«i-r*xe$^tr*?4t:«l: Df^SS 
£tifcfc©T*-&3> (4) tlB«8©^^ 
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(7) (4) lzmm<Dl5 mz £ Wmi; o Z> cMhy 4 7 v 

cDNA5^75U-. !ftt3^£WE>K^cDNA5^ ^7 U-©5'Mi5i 

it). »*fl«Jt:±gcDNA^n->**«1-a^i:3b$-e#x £ S>fclM!l/fccDNA* 
D->*S^-ra^i;t:«fc»)^»4^?)t:S«)fccDNA7'f 75 U-^ffMr 

NA£D->£3HR-r3#&fci N (a) cDNA^-f 75 'J -C^£ti£cDNA^ d-> 
©5'^/b^Ml^iJ£&&T3X*i, (b) &££ftfcigSIJ5»J + fc:ia*&:3 h* 

is, (c) gaifezj H>tf^$tiTi^i:*ij^$nfc^D->s3HR-rs 

M£^iro *l6W©^±mcDNA7>f y^'J-Of^Sa^H:, (d) 
gft $ tt £ * □ - > * rl^-r 5 IS * ^tf o 

tLttt^ ^CftlUJi&V^. *D->#^£ft4£©«<fcV^>f 75 'J-Cfi 

ilLT^ScDNAO^dtO^^fiT-fSJi^^^nSo 3©fc#>cDNA£ D-> 
ii. ±I3©^£M©^cDNA5 ^ 75 'J-©m£&, 0"]*.l;JU mRNA©CAP(iRNA 

u >*-%»*wt:iB^-rs*u^^y7i*(#» • &lu, m&n mm. mm, 

38, 476 -481 (1993).. 11611 ft& 41, 603-607 (1996)., 

M. Maruyama and S. Sugano, Gene, 138, 171-174 (1994).). ^-'Jn>t v 7i£ 
^0kayama-Bergri$:^-&^-«-TPf^Lfc3jt^7l-U n>^^7^(S. Kato et a 
1., Gene, 150, 243-250 (1994).. MB • FMU ^¥6-153953 1994^6330). 
CAPtDNAU >*~*<b^!K^-rS U >*-<b^*g£8(N. Merenkova and D. M. 
Edwards, W0 96/34981 Nov. 7, 1996.). CAP© £3*-^ y 7{b^ 

f^i£(P. Carninci et al., Genomics, 37,327-336 (1996).. P. Carninci et 
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al., DNA Research, 4, 61-66 (1997).), #$£fcttHela8IIB&©Capg£* 
Kfcffl^S^S (I.Edery et al.,MCB, 15,3363-3371(1995)) N Cap Switch olig 
onucleotide£T-&£Cap Finder £ X *) ftSt^tifc^ >f 7^ U £ £:# 

cDNA^^^U— *»?>©cDNA^D — >0#d{±. (J.Sambrook.E.F.Frits 
ch & T.Maniatis, Molecular Cloning Second edition, Cold Spring Harbor Labo 
ratory Press 1989) fc£C!3«©^?££<fc Dfr-Sdfctf-CSSo 

A X & © DN A */ - T > * IS H £ DN A { y - >T > V - m £ ffl ^ T ft & t <fc 0 fi= 9 £ i: A 5 
figK*Jtt±&*1]*&£*S 1 &gli&<^ 5' 100(Nfi£gl££& 

^nrtfg^ift^L^L fci£S15?'J + C BM& 3 H > ##S-r S t flit 
cDNA^P->(±. Sl&fcJU 5c£ficDNAT-& »K X 3 T*M£^i£*^ Li£& 

lEWtH^ K>*q££LfcV^J£3nfccDNA£n->fcfc N T^ftcDNAT' 

£ ft fccDNA ^ o - > fcgiR-T SCiiCi^ #J t 5t£ScDNA£ S d i: 
3b*qrtlt-*So HKitilUte^J 1 tlB«©-7°py7A^ffl^fcM*ff©-^JT-(i, £ 
^m** 5 51%©cDNA7^^7 U-^^D->©^tfflV^fc^, ^37 (ftiftiS 
0.94) £0.5J^±T^p->£^KT3^p->©^£SM71L ^3^*0. 
70JUJ:tf ^P->Sa^1"Si:^o->©^S*tt77JU X3r^0.80J^±T-^ 
n->&3HRTS — >©§t±g*t±81k *3 7£0.90W±-C?n->fc3 
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ttfcfcftfccDNA^ 75 y-£ff#SStSZfcfcqJflrefe3o £©£-5 fc* w< 
#Sfc$8^ftftcDNAfr£&ScDNA7>f 7-5 «J -©£3;S8Jj|£-& 

% Bjj & niter -5 fe sb(om&(Dmm 
»£;h,3&©-ey:&^o 

[«0<J 1 ] cDNA©HaR5IJ&3 H>4W5 7n ^7AOft» 
*^B^©cDNA©PlRHte=i h*>%^iWr57P77 cDNAiE^JKrJttf-^x. 
e>*ifcB#i3, *©EtJC^£ft5£T©ATG©it>a»e>g©MaRBMQ3 K>e>L^fc© 
4^jMtSo ^SUi, A) af@LfcATG©ii5ffl|©-^«Hfia (#10 

©*ft*ftfc:fcttaiflaR«J$e, LZvmmt, B) *@LfcATG©tt 
®©ffllRPH*fe3 K>£ L£©tt«*flHNTfr-5o fl§R^h#fflaR 
^A s ¥'J0^L-^v^?.la?|J§^i(ffll^T, fflHRMM£:§liRll#i=i K WtiSClE^J© 

^»&*tfiaiLT*<o T-n^Aii, ^tL?>©#tati«*fflv^THiRga*&3i 15 > 

>f J A^^7 V>^iJ7d77 Atfe^Gene Finder (Solovyev V.V., Salamov 1 
A. A. .Lawrence C.B. Predicting internal exons by oligonucleotide composit 
ion and discriminant analysis of spliceable open reading frames. Nucleic. 
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Acids Res. (1994) 22: 5156-63.) ^-CfflU S>ftTl^iS»^J»J#ffr£ , ^$1© 

Tl^^-*-£^£^©IE#PM«2:T3) CW5o SP*>^ 

tiSo H^HI&s K>©l2£(i> H^H&s h*>?>L£©8IW-£©L£WI 
£jg;L.5fri:*9frT-*f-5#s 0&^ffil±*©&©##r©£tl-£Ji&bT* IP*k £ 

40%©y-rX^§1--^qJtg-Cfctl(i\ L#VMi0.6££Rffl-r;h,««fc^o fi^©;^/ 

±i£©AfcB©til«£^iWbUfc*ft*ft3o©««£m^o 
Hftfttckt A) *BUfcATG©P0ffl©-^eH1S« (»10-»1001gat) ©^ 

ft*ft£fctt3ffliR1E$&L£©tS«*:LT«U (DATGfr 7. h ^73 Y>tX 
(g^-e, ATG©300Jg£&£T?) C^S*iS6|gSi^©jS)gi»«, ©ATG©if^ 

50Jg*t^*tis6JgS3!c^©«itti«©M. ®^>y±;u^r^ ks> u£©i§& 

(ATG©&30T~ y|£ (90J&S) ©+-C, gfcll*tt&8^yg*:fci£©fifc!M4*B 
«0 £ffl^fc« ^fcs B) #iUfcATG©M3£©PBR«3 h' > L£©tS$8i: L 
TB\ ®ATG©14fiSB5*»e>5Je*gfcS'C©««trt©3Jfi*^S*tti: LfcSfr 
tT^J'fiffiv ©ATG©lutz|Bli;7l/-AT"ft!i©ATG^$ ) ^^^VN^ (fotUi'k 
tttfO) , ®ATG©361^Sfu^^7lMSHU*T"©pM^(3a^tiSv h $/>£&©£ 

[Hife0«2] *U:f***7fcfc:,fcacDNA©SM!L £ J;lflHiRltl*&:3 H >i^fl 
7D^7A (ATGpr) C«fc3#$r 
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eratocarcinoma NT-2 N neuroblastoma SK-N-MC) <£ t> £^ (J.Sambrook.E.F.Fri 
tsch & T.Maniatis, Molecular Cloning Second edition, Cold Spring Harbor La 
boratory Press 1989) 13«&©;m£t' £ *3mRNA£#&mUfco U y 

7-u>*- (ib?'J#-# : i) *vj&i7#?9--r : 7>c*- {mm^: 

2) (aii:2©®^) * $tii7>n?m-r7^7- (ia?y« : 

3) (3t3hH4©t§£r) fcfflvvtXitt ^fift M M, 41, 
603-607 (1996). p606, Y.Suzuki et al., Gene, 200, 149-156(1997)) ©BEt&t&W 
BAPfflsSx TAPil, RNA^>f ^—>3 H-tgcDNA©£r$ *:RNA©l&ic£fTo;fc„ 
&ive, PCR£«fc !)2*gteDNAK:'£&U SfiliJJIgT Lfc 0 DralllT'OTb 
fc^?*-pME18SCG, pMFL^ (§© • AllJ, M6ft &m, 38, 472-481 (1 
993).p480) fccDNA©#fa££&»T*P— -^Lfco d tlti tJ^fcDNAto 
V^TDNAi' — ^ >i/>^M^(DyeTerminator Cycle Sequencing FS Ready Reacti 
on Kit, PE Applied Biosystems)£fflV^T - a7il/i: Lfctf^T v-^>v> 
^KJfogL DNA^>-^>-!t-(ABIPRISM 377, PE AppliedBiosysteis)-CDNAJSS& 
5iJ%»*fLfco &?n — >©5'£<fc»K ^ft^ft 10©^DNAIE?'J£8¥#Tbfco 

#^D->ODNAia5"Jt:^^Tx BII6LfccDNA©HIRia*&3 K>^$j7*P^A 
(ATGpr) 7?HS?BI>&3 K>A»#4-rS3b»S*to§ji¥«f Lfco <! ©Eft/r^P £^ 

(1) 3j-ijif#-v yr&-CcDNA*ft$LfccDNA©-5-£* :7> U -t 1 ' 

4 >^7 I/— Atf^— *T*ffi&©*D — >fc:o^T©ffli?Bli&:3 F>©8? 
tff 

iltfT 1 -^^— ^1:S56»I©^P-> (F-NT2RP1000020, F-NT2RP1000025. F-NT2R 
P1000039, F-NT2RP1000046) izo^TOfllflHg^fcJUTfc^l" (111) „ F-NT2R 
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P1000020(880 bp)l±. 88-690{£# rHuman neuron-specific gamma-2 enolasej 
(GenBank accession No.M22349)(c*t LfflPi4#96Xs F-NT2RP1000025(645 bp)tt\ 
29-64Hi# l"Human alpha-tubulin mRNAj (GenBank accession No.K00558) 
L*@|e0tt#97L F-NT2RP1000039(820 bp)Hu 12-820&# THuman mRNA for elon 
gation factor 1 alpha subunit(EF-l alpha) j (GenBank accession No. X0355 
8)K#L*BPH4#96k F-NT2RP1000046(788 bp)&3-788&# ""Human M2-type py 
ruvate kinase mRNAj (GenBank accession No.M23725)ti;ft L*lln]t4#97rC"fe£o 

ztibta-^ns'MmmcD&mm&ztiztiE&im^ : 4, 5. 6, 

7 (It^To 

3tl 



F-NT2RP1000020 



ATG 


ATG© 


ATGpr 


No. 


\m 




1 


1 


0.05 


2 


162 


<0.84> 


3 


292 


0.05 


4 


313 


0.05 


5 


441 


0.05 



F-NT2RP 1000025 

ATG© ATGpr 

liLW. 737 

96 <0.94> 

148 0.13 

193 0.05 

201 0.09 

232 0.05 



F-NT2RP 1000039 

ATG© ATGpr 

65 <0.90> 

154 0.05 

209 0.11 

231 0.05 

321 0.05 



F-NT2RP 1000046 

ATG© ATGpr 

111 <0.94> 
174 0.82 
198 0.19 
300 0.16 
315 0.11 



(v±D < > : H^Ufcn K> 

(&2) ATG©{itH : 5' -*«fc D ©DNAIB?iJ-e©ATG©??ft1-§^No. 4^1" 0 
ATG No. : 5' -3fc<fc D ©DNAI2?iJT'©ATG©No. „ 

Si jF^TJ^tx ^- , j3"^rir-y7*ST'cDNA^^^L/tcDNA©a^> fCt* 
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S©^o->to^TP^LfccDNA©illRPI^3 H>^iJ7-Dy^A (ATGpr) T* 

(2) * U =3-^ y :r&TcDNA£ttjE£ LfccDNA© o illRF^n K 
L ft V ^ d i: #7= - £ -^-X T* M© t? D - > t o ^ T ©fflsRH*S =3 H > © JSMff 
«¥l)f&fi : ofc^D->©-5*), ^LfclEWtlEtR^&ri F>tf«b^ 
V^Cl htf^-^-^-^-t'gEfcl]©^-^ (F-NT2RP 10000 13, F-NT2RP1000054 S 
F-NT2RP1000122) icoi>T©|$ftr^£ ^Tl^l" (£2) „ F-NT2RP1000013(6 
08 bp){il-606fii# r Human nuclear matrix protein 55 (nmt55) mRNAj (GenBan 
k accession No. U89867)t# Lffl|B]t£#97L F-NT2RP1000054( 869 bp)(il-869{it 
# r Human signal recognition particle (SRP54) mRNAj (GenBank accession N 
o. U51920)(i*ru*§|I)t£#96L F-NT2RP1000122(813 bp ){± 1-81 3Iil# rn.sapien 
s mRNA for 2-5A binding proteinj (GenBank accession No.X76388)Ci;*il'L;i : @|Hj 
t4#98XT-fcofc 0 ft£, Ztibtv-XDS' *#{fl!l©Jfi2E9J£*;lx*ftK?!]# 
*f : 8, 9, 10 fc^-To 
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F-NT2RP1000013 FH 



ATG 


ATG© 


ATGpr 


ATG© 


No. 




7,UT 


m. 


1 


21 


0.05 


31 


2 


27 


0.05 


60 


3 


32 


0.32 


87 


4 


56 


0.11 


97 


5 


119 


0.10 


146 


6 


125 


0.08 


172 


7 


141 


0.05 


180 


8 


155 


0.06 


218 


9 


161 


0.06 


272 


10 


176 


0.08 


319 


11 


203 


0.07 


346 


12 


290 


0.20 


363 


13 


311 


0.16 


409 


14 


314 


0.12 


480 



F-NT2RP 1000 122 



ATGpr 


ATG© 


ATGpr 




fufi 




0.12 


23 


0.07 


0.20 


100 


0.05 


0.05 


166 


0.05 


0.05 


235 


0.06 


0.05 


316 


0.05 


0.05 


346 


0.05 


0.11 


406 


0.05 


0.07 


431 


0.05 


0.05 


469 


0.06 


0.07 


546 


0.12 


0.05 


553 


0.05 


0.07 


574 


0.05 



0.05 
0.07 



m 2 ifiT^TX 9l:t'J y :7y£T-cDNA£ML£cDNA©9 fZlz*- 
^±S©^P-XCO^TP^^LfecDNA©S3IRP^^i3 h* >^$J7"o A (ATGp 
(3) U:2**y7&TcDNA£ft$LfccDNA0-5fcN fr*Iffi?'J©? 
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P-> (F-ZRV6C1000408, F-ZRV6C1000454. F-ZRV6C1000466, F-ZRV6C1000615, 
F-ZRV6C1000670) \Z"D^T<D#tii1£m*KXT£7p? (S3) o :*l^D 
->©5'*JgfflOlSSiB5!J4*n^tiiB5<J#^ : 1 1, 12, 13, 14, 15 
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13 





F-ZRV6C1000408 


F-ZRV6C 1000454 


F-ZRV6C 1000466 


ATG 


ATG© ATGpr 


ATG® ATGpr 


ATG© ATGpr 


No. 






lug 








1 


85 


<0.94> 


5 


0.05 


162 


<0.86> 


2 


208 


0.22 


107 


<0.87> 


182 


0.05 


3 


386 


0.05 


153 


0.05 


207 


0.08 


4 


518 


0.11 


201 


0.08 


244 


0.05 


5 


545 


0.05 


211 


0.05 


262 


0.05 


6 






236 


0.07 


303 


0.11 



F-ZRV6C1000615 F-ZRV6C1000670 
ATG ATG© ATGpr ATG© ATGpr 



No. 




Z.-3T 


\m 


^37 


1 


85 


<0.94> 


120 


<0.94> 


2 


208 


0.26 


187 


0.54 


3 


386 


0.05 


312 


0.06 


4 


518 


0.09 


388 


0.05 


5 


545 


0.05 


445 


0.05 



IE 3 #^-f ICF-ZRV6C 1000408, F-ZRV6C1000454, F-ZRV6C 100 0466, F-ZRV6 
C1000615, F-ZRV6C1000670(IO^T&, **l**i85{fiu 107(4, 162(4, 85{4, 12 
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OtiL<Dt&& r Aj r ATGj ifimtfe? F>T*&S£:¥»J£<*tlfco ZCDtzlbZ 

*l c, >7 D - > li^±ficDNA* D - > -C ft 3 i:¥Wr L fe 0 

(F-ZRV6C1001410, F-ZRV6C1001197, F-ZRV6C1001472) (ZO^T 

§^ti^niE?'J#^ : 1 6, 17, 18t^To 



F-ZRV6C1001410 F-ZRV6C1001197 F-ZRV6C1001472 



ATG 


ATG© 


ATGpr 


ATG© 


ATGpr 


ATG© 


ATGpr 


No. 


\m 








as 




1 


23 


0.05 


5 


0.24 


77 


0.25 


2 


31 


0.07 


141 


0.25 


126 


0.05 


3 


71 


0.06 


202 


0.05 


149 


0.05 


4 


178 


0.05 


219 


0.05 


194 


0.05 


5 


214 


0.05 


228 


0.05 


213 


0.22 


6 










249 


0.05 


7 










338 


0.09 


8 










344 


0.05 


9 










351 


0.05 


10 










365 


0.05 



H4#^-f £?(CF-ZRV6C1001410. F-ZRV6C1001197. F-ZRV6C1001472(IO^T 
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1. ^±ftcDNA*n->£SjRrs:£ircfco-C\ 

(a) cDNA7^ 7*7 U-(z^*ti2.cDNA^P->©5'»^?,iiSiH^J=&^^-r 
3X|£, 

(b) (a) T-^S^tifc^iB^J*(3»=i H>A«^*nT^SfrS*»*, 19 

ten k > s^ai-r s 7* p ^ t «t o -r pjjrr * x*i x 

(c) (b) T'P^p K^^^tlTV^SJlliJS^tlfc^P-^SaiR-rSXg, 

2 . cDNA7 7"7 U -*J^g*CDSS^cDNA5 >f 7*3 U -ftm&lz «k. Df^Ut^ 
ti 3, 1 tlBIRo^ffio 

(a) cDNAv-f 7*5 V-£$£ftZcdM9 n-><D5' %Wfrbft£E!&}%VtfeT 

&X*I N 

(b) (a) T^S^nfcJfiSE^J^tglteii K>#a£*iTt^-5frSfr£s H§ 
ten HV^^jU-r^rpy^AC.fcoT^JST^X^ 

(c) (b) xmt&? Y>b^%fax^htmfeztitc?u->*m%itz>*.m.s 

(d) (c) T-S*R£*ifc?n->£g£-rsx« N 

5 . cDNA7^f 7*7 U-/b'^^$©iS^cDNA7^ 7*7 D-ft-SUit <fc t)flM&£ 

6 . cDNA7 -f 7*7 U-^mRNAcDCAP^MT^Xli^^tr^vStcfc tifc 
7. »*JS4(-:ia«<5D55St:<J;t)^»U'5 4cDNA7-f >J-„ 
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SEQUENCE LISTING 
<110> Helix Research Institute, Inc. 

<120> The method for selecting full length cDNA clones 

<130> H1-806PCT 

<150> JP 09-289982 
<151> 1997-10-22 

<160> 18 

<170> Patentln version 2.0 

<210> 1 
<211> 30 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Oligo-capping linker sequence 
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<400> 1 

AGCAUCGAGU CGGCCUUGUU GGCCUACUGG 30 

<210> 2 
<211> 42 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Oligo(dT) adapter primer sequence 
<400> 2 

GCGGCTGAAG ACGGCCTATG TGGCCTTTTT TTTTTTTTTT TT 42 

<210> 3 
<211> 32 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Random adapter primer sequence 
<400> 3 

GCGGCTGAAG ACGGCCTATG TGGCCNNNNN NC 



\ 

32 
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<210> 4 

<211> 880 
<212> DNA 
<213> Homo sapiens 

<400> 4 

ATGCGCCCGC GCGGCCCTAT AGGCGCCTCC TCCGCCCGCC GCCCGGGAGC CGCAGCCGCC 60 

GCCGCCACTG CCACTCCCGC TCTCTCAGCG CCGCCGTCGC CACCGCCACC GCCACTGCCA 120 

CTACCACCGT CTGAGTCTGC AGTCCCGAGA TCCCAGCCAT CATGTCCATA GAGAAGATCT 180 

GGGCCCGGGA GATCCTGGAC TCCCGCGGGA ACCCCACAGT GGAGGTGGAT CTCTATACTG 240 

CCAAAGGTCC TTTCCGGGCT GCAGTGCCCA GTGGAGCCTC TACGGGCATC TATGAGGCCC 300 

TGGAGCTGAG GGATGGAGAC AAACAGCGTT ACTTAGGCAA AGGTGTCCTG AAGGCAGTGG 360 

ACCACATCAA CTCCACCATC GCGCCAGCCC TCATCAGCTC AGGTCTCTCT GTGGTGGAGC 420 

AAGAGAAACT GGACAACCTG ATGCTGGAGT TGGATGGGAC TGAGAACAAA TCCAAGTTTG 480 

GGGCCAATCC ATCCTGGGTG TGTCTCTGGC CGTGTGTAAG GCANGGGCAA CTGAACNGGA 540 

ACTGCCCCTG TATCGCCACA TTGCTCAGCT TGGNCGGGAA CTCANACCTC ATCCTGCCTG 600 

TTGCCGGCCT TCAACGTGAT CAATGGTTGG CTTCTCATGC CTGGCAACAA ANCTGGCCAT 660 

TGCNGGAATT TTCATGATCC TCCCCNTTGG GAAACTGAAA AACTTTCCGG AATGCCCNTC 720 

CAACTAAGTT GCAAAAGGTC TACCNATACC CCCCAAGGGG AATTCCTCCA AGGGAACAAA 780 

TNCCCGGGAA AGGAATGCCC CCCAATTNTT NGGGGGAATA AAAGGTGGGC TTTGCCCCCC 840 

CATTTTCCTG GAAAAAACNA TNAAAACCCT TGGGAAACTT 880 

<210> 5 



<211> 645 
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<212> DNA 

<213> Homo sapiens 

<400> 5 

TGTGCGTTAC TTACCTCNAC TCTTAGCTTG TCGGGGACGG TAACCGGGAC CCGGTGTCTG 60 

CTCCTGTCGC CTTCGCCTCC TAATCCCTAG CCACTATGCG TGAGTGCATC TCCATCCACG 120 

TTGGCCAGGC TGGTGTCCAN ATTGGCAATG CCTGCTGGGA GCTCTACTGC CTGGAACACG 180 

GCATCCAGCC CGATGGCCAG ATGCCAAGTG ACAAGACCAT TGGGGGAGGA GATGACTCCT 240 

TCAACACCTT CTTCAGTGAG ACGGGCGCTG GCAANCACGT GCCCCGGGCT GTGTTTGTAG 300 

ACTTGGAACC CACAGTCATT GATGAAGTTC GCACTGGCAC CTACCGCCAG CTCTTCCACC 360 

CTGAGCAGCT CATCNCAGGC AAGGAAGATG CTGCCAATAA CTATGCCCGA GGGCACTACA 420 

CCATTGGCAA GGAGATCATT GACCTTGTGT TGGACCGAAT TCGCAAGCTG GCTGACCANT 480 

GCACCGGTCT TCANGGCTTC TTGGTTTTCC ACAGCTTTGG TGGGGGAACT GGTTCTGGGT 540 

TCACCTCCCT GCTCATGGAA CGTCTCTCAG TTGATTATGG CAAGAAATCC AAGCTGGAGT 600 

TCTCCATTTA CCCAGCACCC CNGGTTTCCN CNGCTGTANT TNGAA 645 

<210> 6 

<211> 820 
<212> DNA 
<213> Homo sapiens 

<400> 6 

CTTTTTTCGC AACGGGTTTG CCGCCAGAAC ACAGGTGTCG TGAAAACTAC CCCTAAAAGC 60 

CAAAATGGGA AAGGAAAAGA CTCATATCAA CATTGTCGTC ATTGGACACG TAGATTCGGG 120 

CAAGTCCACC ACTACTGGCC ATCTGATCTA TAAATGCGGT GGCATCGACA AAAGAACCAT 180 
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TGAAAAATTT GAGAAGGAGG CTGCTGAGAT GGGAAAGGGC TCCTTCAAGT ATGCCTGGGT 240 

CTTGGATAAA CTGAAAGCTG AGCGTGAACG TGGTATCACC ATTGATATCT CCTTGTGGAA 300 

ATTTGAGACC AGCAAGTACT ATGTGACTAT CATTGATGCC CCAGGACACA GAGACTTTAT 360 

CAAAAACATG ATTACAGGGA CATCTCAGGC TGACTGTGCT GTCCTGATTG TTGCTGCTGG 420 

TGTTGGTGAA TTTGAAGCTG GTATCTCCAA GAATGGGCAG ACCCGAGAGC ATGCCCTTCT 480 

GGCTTACACA CTGGGTGTGA AACAACTAAT TGTCGGTGTT AACAAAATGG ATTCACTGAN 540 

CCACCCTACA GCCAGAAGAA ATATGANGAA ATTGTTAAGG AAGTCAGCAC TTACATTAAG 600 

AAAATTGGCT ACAACCCCGA CACAGTANCA TTTGTGCCAA TTTCTGGTTG GAATGGTGAC 660 

AACATGCTGG AACCAANTGC TAACATGCCT TGGTTCCAGG GATGGAAAAT CCCCCNTTAA 720 

GGATGGCNAT GCCATTGGAA CCCCCCTGCT TGAAGGCTCT GGANTGCATC CTANCACCAA 780 

CTCCTTCAAA TTGAAAAACC CCTTGCNCCC GCCTCCNCCA 840 

<210> 7 

<211> 788 
<212> DNA 
<213> Homo sapiens 

<400> 7 

GAGGCTGAGG CAGTGGCTCC TTGCACAGCA GCTGCACGCG CCGTGGCTCC GGATCTCTTC 60 

GTCTTTGCAG CGTAGCCCGA GTCGGTCAGC GCCGGAGGAC CTCAGCAGCC ATGTCGAAGC 120 

CCCATAGTGA AGCCGGGACT GCCTTCATTC AGACCCAGCA GCTGCACGCA GCCATGGCTG 180 

ACACATTCCT GGAGCACATG TGCCGCCTGG ACATTGATTC ACCACCCATC ACAGCCCGGA 240 

ACACTGGCAT CATCTGTACC ATTGGCCCAG CTTCCCGATC AGTGGAGACG TTGAAGGAGA 300 

TGATTAAGTC TGGAATGAAT GTGGCTCGTC TGAACTTCTC TCATGGAACT CATGAGTACC 360 

ATGCGGAGAC CATCAAGAAT GTGCGCACAG CCACGGAAAG CTTTGCTTCT GACCCCATCC 420 
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TCTACCGGCC CGTTGCTGTG GCTCTAGACA 
TCAAGGGCAG CGGCACTGCA GAGGTGGAGC 
TGGATAATGC CTACATGGAA AAGTGTGACG 
TCTGCAAGGT GGTGGAAGTG GGCAACAAGA 
AGGTGAACAC AAAGGTGCCG ACTTCCTGGG 
GCNCAAGAAA GGTGTGAACT TCCTGGGGCT 
GACATCCA 
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CTAAAGGACC TGAGATCCGA ACTGGGCTCA 480 

TGAAGAATGG AGCCACTCTC AAAATCACGC 540 

AGAACATCCT GTGGCTGGAC TACAAGAACA 600 

TCTACGTGGA TGATGGGCTN ATTTCTCTCC 660 

TGACNGANGT GGAAAATGGT GGCTCCTTGG 720 

GCTGTGGANT TGCCTGCTGT GTCNGAAAAA 780 

788 



<210> 8 



<211> 608 
<212> DNA 

<213> Homo sapiens 



<400> 8 

ACAGCCTGGC TCCTTTGAGT ATGAATATGC 
GAAGCAGCAG CAGGACCAAG TGGACCGCAA 
GGAGATGGAA GCTGCACGCC ATGAGCACCA 
GCGCCAAGAA GAACTTCGGA GGATGGAAGA 
GCAACTGGAG CTCAGGCAGG AGGAANAGCG 
GCAAGAAGAA ATGATGCGGC GACNGCAGGA 
AGAGCAGGAG ATTCGGATGG GTCNGATGGC 
ATGTGCCATG CCCCCTGCTC CTGTGCCAGC 
TATTATGCCG GATGGAACTT TGGGATTGAC 
TGCTACNATG GAANGAATTG GGGCAATTGG 
TCCTGGGA 



CATGCGCTGG AAGGCACTCA TTGAGATGGA 60 

CATCNAGGAG GCTCGTGAGA AGCTGGAGAT 120 

GGTCATGCTA ATGAGACAGG ATTTGATGAG 180 

GCTGCACAAC CAAGANGTGC AAAAACGAAA 240 

CAGGCGCCGT GAAGAANAGA TGCGGCGGCA 300 

AGGATTCAAG GGAACCTTCC CTGATGCGAG 360 

TATGGGAGGT GCTATGGGCA TAAACNACAG 420 

TGGTACCCCA GCTCCTCCAG GACCTGCCAC 480 

CCCACCNACA ACTGAACGCT TTGGTCNGGC 540 

TGGAACTCCT CCTGCATTCN ACCGTGCAGC 600 

608 
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<210> 9 

<211> 869 
<212> DNA 
<213> Homo sapiens 

<400> 9 

ATATTAAACT AGTGAAGCAA CTAAGAGAAA 
TGGCATCTGG TCTTAACAAA AGAAAAATGA 
AGCTTGTAGA CCCTGGAGTT AAGGCATGGA 
TGTTTGTTGG ATTGCAAGGG AGTGGTAAAA 
ACCAGAGGAA AGGTTGGAAG ACCTGTTTAA 
TTGACCAACT AAAACAGAAT GCTACCAAAG 
AAATGGATCC TGTCATCATT GCTTCTGAAG 
AAATTATTAT TGTTGATACA AGTGGCCGCC 
TGCTTCAAGT TGCTAATGCT ATACAACCTG 
TTGGGCAGGC TTGTGAAGCC CAGGCTAAGG 
AATAGTGACA AAACTTGATG GCCATGCAAA 
CACAAAAAAT CCGATTATTT TCATTGGTAC 
TCAAAAACAC AGCCTTTTAT TAACAAACTT 
AAATAAAGTC CACNAATTGA AATTTGGATG 
ACATNGTCCA GTTTTACTTT GCGAAACNT 



ATGTTAAGTC TGCTATTGAT CTTGAAGAGA 60 

TTCAGCATGC TGTATTTAAA GAACTTGTGA 120 

CACCCACTAA AGGAAAACAA AATGTGATTA 180 

CAACAACATG TTCAAAGCTA GCATATTATT 240 

TATGTGCAGA CACATTCAGA GCAGGGGCTT 300 

CAAGAATTCC ATTTTATGGA AGCTATACAG 360 

GAGTAGAGAA ATTTAAAAAT GAAAATTTTG 420 

ACAAACAAGA AGACTCTTTG TTTGAAGAAA 480 

ATAACATTGT TTATGTGATG GATGCCTCCA 540 

CTTTTAAAGA TAAAGTAGAT GTACCTCAGT 600 

ANGAAGTGGT GCACTCAGTG CAGTCGCTGC 660 

AGGGGGAACA TATANATGAC TTTGAACCTT 720 

CTTGGTATNG GCGACATTGA AAGGACTGAT 780 

ACNATGNAAA CCCTTATTGA AAAAATTGAA 840 

869 



<210> 10 
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<211> 813 
<212> DNA 
<213> Homo sapiens 



<400> 10 

GTTGTGGTAT CTGTATTAAG AAATGCCCCT 
GCAACTTGGA AAAAGAAACC ACACATCGAT 
TGCCTATCCC TCGTCCAGGT GAAGTTTTGG 
CAACTGCTTT AAAAATTTTA GCAGGAAAAC 
CTCCTGACTG GCAGGAGATT TTGACTTATT 
CAAAGATTCT AGAAGATGAC CTAAAAGCCA 
CTAAGGCTGC AAAGGGGACA GTGGGATCTA 
AGGCAATTGT ATGTCAGCAG CTTGATTTAA 
TTTCAGGAGG AGAGTTGCAG AGATTTGCTT 
TTTTCATGTT TGATGAGCCT TCTAGTTACC 
TTACTATACG ATCTCTAATA AATCCAGATA 
GTGTATTAGA CTATCTCTCC GACTTCATCT 
GAATTGTCAC TATGCCTTTT AGTGTTAGAA 
TCCAACAGAA AACTTGANAA TCNNAAATGC 

<210> 11 



TTGGCGCCTT ATCAATTGTC AATCTACCAA 60 

ATTGTGCCAA TGCCTTCAAA CTTCACAGGT 120 

GATTAGTTGG AACTAATGGT ATTGGAAAGT 180 

AAAAGCCAAA CCTTGGAAAG TACGATGATC 240 

TCCGTGGATC TGAATTACAA AATTACTTTA 300 

TCATCAAACC TCAATATGTA GACCAGATTC 360 

TTTTGGACCG AAAAGATGAA ACAAAGACAC 420 

CCCACCTAAA AGAACGAAAT GTTGAAGATC 480 

GTGCTGTCGT TTGCATACAG AAAGCTGATA 540 

TAGATGTCAA GCAGCGTTTA AAGGCTGCTA 600 

GATATATCAT TGTGGTGGAA CATGATCTAA 660 

GCTGTTTATA TGGTGTACCA AGCGCCTATG 720 

AAGGCATAAA CNTTTTTTGG ATGGGTATGT 780 

NTC 813 



<211> 655 
<212> DNA 
<213> Homo sapiens 



\ 
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<400> 11 

AGACTCTCAC CGCAGCGGCC AGGAACGCCA GCCGTTCACG CGTTCGGTCC TCCTTGGCTG 60 

ACTCACCGCC CTCGCCGCCG CACCATGGAC GCCCCCAGGC AGGTGGTCAA CTTTGGGCCT 120 

GGTCCCGCCA AGCTGCCGCA CTCAGTGTTG TTAGAGATAC AAAAGGAATT ATTAGACTAC 180 

AAAGGAGTTG GCATTAGTGT TCTTGAAATG AGTCACAGGT CATCAGATTT TGCCAAGATT 240 

ATTAACAATA CAGAGAATCT TGTGCGGGAA TTGCTAGCTG TTCCAGACAA CTATAAGGTG 300 

ATTTTTCTGC AAGGAGGTGG GTGCGGCCAG TTCAGTGCTG TCCCCTTAAA CCTCATTGGC 360 

TTGAAAGCAG GAAGGTGTGC GGACTATGTG GTGACAGGAG CTTGGTCAGC TAAGGCCGCA 420 

GAAGAAGCCA AGAAGTTTGG GACTATAAAT ATCGTTCACC CTAAACTTGG GAGTTATACA 480 

AAAATTCCAG ATCCAAGCAC CTGGAACCTC AACCCAMTG CCTCCTACGT GTTTTATTGC 540 

NCAAATGAAA CGGTGCATGG TGTTGANTTT GACTTTATAC CCNATGTCAA GGGAACANTA 600 

CTGGTTTGTG ACATTTTCCT CCAACTTCCT GTCCAANCCA ATTGNATGTT TCCAA 655 

<210> 12 

<211> 599 
<212> DNA 
<213> Homo sapiens 

<400> 12 

AAAGATGCGC AGGCGCCGTG TGGCACTCGG CGGTCGAAAG GGGAGTTCAA GGAGACGGGG 60 

GCGACGCGGC TGAGGGCTTC TCGTCGGGGT CGGGGCTGCA GCCGTCATGC CGGGGATAGT 120 

GGAGCTGCCC ACTCTAGAGG AGCTGAAAGT AGATGAGGTG AAAATTAGTT CTGCTGTGCT 180 

TAAAGCTGCG GCCCATCACT ATGGAGCTCA ATGTGATAAG CCCAACAAGG AATTTATGCT 240 

CTGCCGCTGG GAANAGAAAG ATCCGAGGCG GTGCTTAGAG GAAGGCAAAC TGGTCAACAA 300 
GTGTGCTTTG GACTTCTTTA GGCAGATAAA ACGTCACTGT GCAGAGCCTT TTACAGAATA 360 
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TTGGACTTGC ATTGATTATA CTGGCCAGCA GTTATTTCGT CACTGTCGCA AACAGCAGGC 420 

AAAGTTTGAC NAGTGTGTGC TGGACAAACT GGGCTGGGTG CGGCCTGACC TGGGAAAACT 480 

GTCAAAGGTC ACCAAAGTGA AAACAGATCN ACCTTTACCG GANAATCCCT ATCACTCAAG 540 

AACAAGAACG GATCCCAGCC CTGANATCNA AGGAAATCTG CANCCTGCCA CACATGGCA 599 

<210> 13 

<211> 597 
<212> DNA 
<213> Homo sapiens 

<400> 13 

ATATCCGGAG TAGACGGAGC CGCAGTAGAC GGATCCGCGG CTGCACCAAA CACTGCCCCT 60 

CGGAGCCTGG TAGTGGGCCA CAAGCCCCCA GTCCCAGAGG CGTGATTTTC TGGCATCCTT 120 

AAATCTTGTG TCAAGGATTG GTTATAATAT AACCAGAAAC CATGACGGCG GCTGAGAACG 180 

TATGCTACAC GTTAATTAAC GTGCCAATGG ATTCAGAACC ACCATCTGAA ATTAGCTTAA 240 

AAAATGATCT AGAAAAAGGA GATGTAAAGT CAAAGACTGA AGCTTTGAAG AAAGTAATCA 300 

TTATGATTCT GAATGGTGAA AAACTTCCTG GACTTCTGAT GACCATCATT CGTTTTGTGC 360 

TACCTCTTCA GGATCACACT ATCAAGAAAT TACTTCTGGT ATTTTGGGAG ATTGTTCCTA 420 

AAACAACTCC AGATGGGAGA CTTTTACATG AGATGATCCT TGTATGTGAT GCATACAGAA 480 

AGGATCTTCA ACATCCTAAT GAATTTATTC NAAGGATCTA CTCTTCGTTT TCTTTGCAAA 540 

TTGAAANAAA CANAATTGCT AAAACCTTTA ATGCCANCTA TNCCTGCATT TTTGGGA 597 

<210> 14 



<211> 634 
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<212> DNA 

<213> Homo sapiens 

<400> 14 

AGACTCTCAC CGCAGCGGCC AGGAACGCCA GCCGTTCACG CGTTCGGTCC TCCTTGGCTG 60 

ACTCACCGCC CTCGCCGCCG CACCATGGAC GCCCCCAGGC AGGTGGTCAA CTTTGGGCCT 120 

GGTCCCGCCA AGCTGCCGCA CTCAGTGTTG TTAGAGATAC AAAAGGAATT ATTAGACTAC 180 

AAAGGANTTG GCATTAGTGT TCTTGAAATG AGTCACAGGT CATCAGATTT TGCCAAGATT 240 

ATTAACAATA CAGAGAATCT TGTGCGGGAA TTGCTAGCTG TTCCAGACAA CTATAAGGTG 300 

ATTTTTCTGC AAGGAGGTGG GTGCGGCCAG TTCAGTGCTG TCCCCTTAAA CCTCATTGGC 360 

TTGAAAGCAG GAANGTGTGC GGACTATGTG GTGACAGGAG CTTGGTCAGC TAAGGCCGCA 420 

NAANAAGCCA AGAANTTTGG GACTATAAAT ATCGTTCACC CTAAACTTGG GAGTTATACA 480 

AAAATTCCAG ATCCAAGCAC CTGGAACCTC AACCCAGATG CCTCCTACGT GTATTATIGC 540 

GCNAATGAAA CNGTGCATGG TGTGGANTCT GACTTTATAC CCGATGTCNA GGGAACATAC 600 

TGGTTTGTGA CATGTCCTCA AACTTCCCGT CCNA 634 

<210> 15 

<211> 757 
<212> DNA 
<213> Homo sapiens 

<400> 15 

AGTCTGCGGT GGGCTANCGG ACGGTCCGGC TTCCGGCGGC CGTTTCTGTC TCTTGCTGGC 60 

TGTCTCGCTG AATCGCGGCC GCCTTCTCAT CGCTCCTGGA AGGTCCCGAG CGCGACACCA 120 

TGTCGGAACC CGGGGGCGGC GGCGGCGAAG ACNGCTCGGC CGGATTGGAA GTGTCGGCCG 180 
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TGCANAATGT GGCGGACGTG TCGGTGCTGC ANAAGCACCT GCGCAAGCTG GTGCCGCTGC 240 

TGCTGGAGGA CGGCGGCGAA GCGCCGGCCG CGCTGGAGGC GGCGCTGGAG GAGAAGAGCG 300 

CCCTGGAGCA GATGCGCAAG TTCCTTTCGG ACCCGCACGT CCACACGGTG CTGGTGGAGC 360 

GCTCCACGCT CAAAGTGGAC GTCGGTGATG AAGGAGAAGA AGAAAAAGAA TTCATTTCCT 420 

ATAACATCAA CNTAGACATT CACTATGGGG TTAAATCCAA TAGCTTGGCA TTCATTAAAC 480 

GTACTCCCGT GATTGATGCA GATAAACCCG TGTCTTCTCA NCTCCGGGTC CTTACACTCA 540 

GTGAANACTC NCCCTACNAA AACTTTGCAT TCTTTCATTA ACAATGCAGT GGCTCCTTTT 600 

TTTAANTCCT ACATTAAAAA ATCTGGCAAG GCAAACAGGG ATGGTGATAA AATGGCTCCT 660 

TCCNTTGAAA AAAAAATTGC CGAACTCNAA ATNGGACTCC TTCCCTTGCA NCAAAATTTT 720 

TGAAATTCCG GAAAATCANC CTGCCCAATT CCTCCCC 757 

<210> 16 

<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 16 

ATCATTTCCT TATTTATATT TCATGTTGGA ATGCTTAAAT CGATAACCTT TGTATTTTGA 60 

AGTGCGCGAC ATGGAAGGTG ATCTGCAAGA GCTGCATCAG TCAAACACCG GGGGATAAAT 120 

CTGGATTTGG GTTCCGGCGT CAAGGTGAAG ATAATACCTA AAGAGGAACA CTGTAAAATG 180 

CCAGAAGCAG GTGAANAGCA ACCACAAGTT TAAATGAAGA CAAGCTGAAA CAACGCAAGC 240 

TGGTTTTATA TTAGATATTT GACTTAAACT ATCTCAATAA AGTTTTGCAG CTTTCACCAC 300 



<210> 17 
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<211> 313 
<212> DNA 

<213> Homo sapiens 



<400> 17 

AAAGATGGCG GCGGGGGAGG TAGGCAGAGC AGGACGCCGC TGCTGCCGCC GCCACCGCCG 60 

CCTCCGCTCC AGTCGCCTCC GGTCCTTCAA ACTCACACCT CCCGGGAGGA GCTGTCCTGG 120 

CGCCGGGTCC CGCGGGGAAA ATGGTGGAGC CAGGGCAAGA TTTACTGCTT GCTGCTTTGA 180 

GTGAGAGTGG AATTAGTCCG AATGACTCTT TGATATTGAT GGTGGAGATG CANGGCTTGC 240 

AACTCCAATG CCTACCCCGT CAGTTCAGCA NTCAGTGCCA CTTANTGCAT TANAACTANG 300 

TTTGGAGACC GAA 313 



<210> 18 



<211> 667 
<212> DNA 

<213> Homo sapiens 
<400> 18 

ACTGCCGGGC TCGGCGTGAG TCGCTGCGGG GCTGACGGGG TGGCAGTGCG GCGGGT-TACG 60 
GCCTGGTCAG ACCATAATGA CTTCAGCAAA TAAAGCAATC GAATTACAAC TACAAGTGAA 120 
ACAAAATGCA GAAGAATTAC AAGACTTTAT GCGGGATTTA GAAAACTGGG AAAAAGACAT 180 
TAAACAAAAG GATATGGAAC TAAGAAGACA GAATGGTGTT CCTGAAGAGA ATTTACCTCC 240 
TATTCGAAAT GGGAATTTTA GGAAAAAGAA GAAAGGCAAA GCTAAAGAGT CTTCCCCAAA 300 
ACCANAGAGG AAAACACNAA AAACAGGATA AAATCTTATG ATTATGANGC ATGGGCAAAA 360 
CTTGATGTGG ACCGTATCCT TGATGAGCTT GACAAAGACG ATAGTACCCA TGAGTCTCTG 420 
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TCTCAAGAAT CAGAGTCGGA AGAAGATGGG 
TTAAAAGAAA AGGGCNATAA ATACTTCCAC 
ACACNAAAGG CNTGGATGCC GATCCATATN 
CATATTTTAG ACTGAAAAAA TTTGCTGTTG 
TGAAATA 



14/14 

ATTCATGTTG ATTCNCNAAA GGCTCTTGTT 480 
AAGGAAAATA TGATGAAGCA ATTGACTGCT 540 
ATCCCGTGTT GCCAACGAAC ANAACNTCCG 600 
CTGAATCTGA TTGTTATTTAN CANTTGCCT 660 

667 



INTERNATIONAL SEARCH REPORT 



International application No. 
PCT/JP98/04772 



A. CLASSIFICATION OF SUBJECT MATTER 
Int. CI 6 C12N15/10 



According to International Patent Classification (IPC) or to both national classification and IPC 
B. HELPS SEARCHED 

Minimum documentation searched (classification system followed by classification symbols) 
Int. CI 6 C12N15/10 



Documentation searched other than minimum documentation to the extent that such documents are included in the fields searched 



Electronic data base consulted during the international search (name of data base and, where practicable, search terms used) 
BIOSIS (DIALOG), WPI/L (DIALOG) 



C DOCUMENTS CONSIDERED TO BE RELEVANT 



Category* 



Citation of document, with indication, where appropriate, of the relevant passages 



Relevant to claim No. 



WO, 94/08001, Al (The Kanagawa Academy of Science) , 
14 April, 1994 (14. 04. 94) 

6 JP, 6-153953, A & EP, 625572, Al 

WO, 96/34981, A2 (GENSET), 

7 November, 1996 (07. 11. 96) 

& EP, 824598, A2 & FR, 2733762, Al 
& FR, 2733765, Al 

Maruyama, K. , et al . , "Oligo-capping : a simple method 
to replace the cap structure of eukaryotic mRNAs with 
oligoribo-nucleotides" , Gene, Vol. 138 (1994), 
p. 171-174 

Kato, S., et al., "Construction of a human full-length 
cDNA bank", Gene, Vol. 150 (1994), p. 243-250 

Carnincle, P., et al., "High-Efficiency Full-Length 
cDNA Cloning by Biotinylated CAP Trapper" , (Genomics, 
Vol. 37 (1996), p. 327-336 



1-7 



1-7 



1-7 



1-7 



1-7 



QTj Further documents are listed in the continuation of Box C. See patent family annex. 



* Special' categories of cited documents: 

"A" document defining the general slate of the art which is not 

considered to be of particular relevance 
"E* earlier document but published on or after the international filing date "X" 
"L" document which may throw doubts on priority daimfs) or which is 

cited to establish the publication date of another citation or other 

special reason (as specified) " Y" 

"O" document referring to an oral disclosure, use, exhibition or other 

means 

"P* document published prior to the international filing date but later than 

the priori ty da te cla imed " &" 



later document published after the international filing date or priority 
date and not in conflict with the application but cited to understand 
the principle or theory underlying the invention 
document of particular relevance; the claimed invention cannot be 
considered novel or cannot be considered to involve an inventive step 
when the document is taken alone 

document of particular relevance; the claimed invention cannot be 
considered to involve an inventive step when the document is 
combined with one or more other such documents, such combination 
being obvious to a person skilled in the art 
document member of the same patent family 



Date of the actual completion of the international search 

13 January, 1999 (13. 01. 99) 


Date of mailing of the international search report 

26 January, 1999 (26. 01. 99) 


Name and mailing address of the ISA/ 

Japanese Patent Office 
Facsimile No. 


Authorized officer 
Telephone No. 



Form PCT/ISA/210 (second sheet) (July 1992) 



INTERNATIONAL SEARCH REPORT 



International application No. 
PCT/JP98/04772 



C (Continuation). DOCUMENTS CONSIDERED TO BE RELEVANT 



Category* Citation of document, with indication, where appropriate, of the relevant passages Relevant to claim No. 

A Edery, I., et al., "An Efficient Strategy To Isolate 1^7 
Full-Length cDNAs Based on an mRNA Cap Retention 
Procedure (CAPture)", Molecular and Cellular 
Biology, Vol. 15 (1995), p. 3363-3371 

A Solovyev, V., et al . , "Predicting internal exons by 1-7 
oligo-nucleotide composition and discriminant 
analysis of spliceable open reading frames" , Nucleic 
Acids Research, Vol. 22 (1994), No. 24, p. 5156-5163 

A Heindell, H.C., et al., "The Primary Sequence of 1-7 
Rabbit a-Globin mRNA" , Cell, Vol. 15 ( 1978), p. 4 3-5 4 

A Minoru Suzuki et al., "RT-PCR Process: Cloning of 5' 1-7 
end of mRNA by Oligocapping Procedure (in Japanese) " , 
Protein, Nucleic Acid and Enzyme, Vol. 41, No. 5 
(1996), p. 603-607 

A Sumio Sugano et al., "Aiming at Full-length cDNA 1-7 
Library: Substitution of Capped Structure by 
Oligonucleotide (in Japanese)", Protein, Nucleic 
Acid and Enzyme, Vol. 38, No. 3 (1993), p. 476-481 

A Carninci, P., et al., "High Efficiency Selection of 1-7 
Full-length cDNA by Improved Biotinylated Cap 
Trapper", DNA Research, Vol. 4, No. 1 (1997), p. 61-66 



Form PCT/ISA/210 (continuation of second sheet) (July 1992) 



PCT/ J P 9 8/0 4 7 7 2 



*M©JB+5»W©»SB (BBM*IKWH (IPC) ) 
Int. CI 6 C12N15/10 



WfcfTofcfc/MRBftt (SffftlfftS (1 PC) ) 
Int. CI* C12N15/10 



BIOSIS (DIALOG) , WPi/L (DI A LOG) 



WO, 9 4/0 8 00 1, Al (ftTia?feA^JI|#^S^T^T f 5 
— ), 14. 4£. 1994 (14. 04. 94) & JP, 6-1 
5 3 9 5 3, A&EP, 6 2 5 5 7 2, Al 

WO, 9 6/3 4 9 8 1, A2 (GENSET) , 7. llfl. 1 
996 (07. 11. 96) &EP, 824598, A2&FR, 
2 7 3 3 7 6 2, Al&FR, 2 7 3 3 7 6 5, Al 

Maruyama, K. , et al. "Oligo-capping: a simple method to replace 
the cap structure of eukaryotic mRNAs with oligoribo- 
nucleotides", Gene, Vol. 138(1994) , p. 171-174 



1-7 



1-7 



1-7 



0^L< ttteo<#55'J4SS«r«fi:-f-5>t«)^5lffl"t-6 
roj aeSdJ^fi^ ffiffl, K^K^&l-SXa 



13. 0 1. 9 9 



26.01.99 



B#B«FffJf ( I S A/ J P) 
Sffi#^ 100-8915 
SuKaj^ftfflES^HHTB 4#3^ 



HJSS^- 03-3581-1 101 



4 B 



8 8 2 7 



3 4 4 8 



8SPCT/ISA/2 10 (S&2"<-S^ (I 9 9 8^7il) 



t% PCT/ JP98/0 4772 



C (tte) ■ 



[g#cDflSIB<P#-§- 



A 



A 
A 



Kato.S. ,et al. "Construction of a human full-length cDNA bank 
", Gene, Vol. 150(1994) , p. 243-250 

Carnincle, P. , et al. "High-Efficiency Full-Length cDNA Cloning 
by Biotinylated CAP Trapper", Genomics, Vol. 37(1996), p. 327- 
336 

Edery, I. ,et al. "An Efficient Strategy To Isolate Full-Length 
cDNAs Based on an mRNA Cap Retention Procedure (CAPture)", 
Molecular and Cellular Biology, Vol. 15(1995), p. 3363-3371 

Solovyev, V. , et al. "Predicting internal exons by oligo- 
nucleotide composition and discriminant analysis of spliceab 
le open reading frames", Nucleic Acids Research, Vol. 22(1994), 
No. 24, p. 5156-5163 

Heindell, H. C. , et al. "The Primary Sequence of Rabbit a- 
Globin mRNA", Cell, Vol. 15(1978), p. 43-54 

*P TRT-PCRjfe ?fV ^ty/fttJ;5raRNA 
5' 5fc*S©^P— WbWfcWtB&k, Vol. 41, No. 5(1996), 
p. 603-607 

*SFt**e % r^^cDNA^^^yy-^fSHt-C **y?m 

m<D*v =r*9 wMa^fij m&mmmmm, voi.38.no. 

3(1993), p. 476-481 

Carninci, P. , et al. "High Efficiency Selection of Full-length 
cDNA by Improved Biotinylated Cap Trapper", DNA Research, 
Vol. 4, No. 1(1997), p. 61-66 



1-7 
1-7 

1-7 

1 - 7 

1-7 
1-7 

1-7 

1-7 



*SCPCT/ 



I S A/2 1 0 i^2^—-J(T>m.^ ) ( 1 9 9 8^7fl) 



